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The crystal structure of the phosphoryiated, activated 
form of the insulin receptor tyrosine kinase in complex 
with a peptide substrate and an ATP analog has been 
determined at 1.9 A resolution. The activation loop 
(A-loop) of the kinase undergoes a major conform- 
ational change upon autophospborylation of iyrll58, 
Tyrll62 and Tyrll63 within the loop, resulting in 
unrestricted access of ATP and protein substrates to 
the kinase active site. Phosphorylated Tyrll63 
(pT^Tll63) is the key phosphotyrosine in stabilizing 
the conformation of the tris-phospborylated A-loop, 
whereas pTyrll58 is completely solvent-exposed, sug- 
gesting an availability for interaction with downstream 
signaling proteins. The YMXM-containing pepdde 
substrate binds as a short anti-parallel p-strand to the 
C-terminal end of the A-loop, with the methionine side 
chains occupying two hydrophobic pockets on the 
C-terminal lobe of the kinase. The structure thus 
reveals the molecular basis for insulin receptor activ- 
ation via autophospborylation, and pro\ides insights 
into tyrosine kinase substrate specificity and the 
mechanism of phosphotransfer. 
Keywords: autophosphorylation/insulin receptor/protein 
tyrosine kinase/substrate specificity/ X-ray 
crystallography 



Introduction 

Insulin stimulates numerous intracellular signaling path- 
ways that regulate cellular metabolism and growth (White 
and Kahn, 1994). The physiological effects of insulin are 
mediated by its cell surface receptor, an 0,2^2 transmem- 
brane glycoprotein with intrinsic protein tyrosine kinase 
activity (Ebina et al, 1985; Ullrich ei ai, 1985). Binding 
of insulin to the extracellular a-chains results in autophos- 
phorylation of specific tyrosine residues in the cytoplasmic 
portion of the jJ-chains: two in the juxtamembrane region, 
three in the kinase (catalytic) domain, and two in the 
C-terminal tail (Tomqvist et ai, 1987; Tavare et al., 1988; 
White et al, 1988; Feener et al, 1993; Kohanski, 1993). 
Autophospborylation of Tyrll58, Tyrll62 and Tyrll63 
in the activation loop (A-loop) of the kinase domain is 
critical for stimulation of kinase activity and biological 
function (Rosen et al, 1983; Ellis et ai. 1986). 

The vast majority of receptor tyrosine kinases contain 
between one and three tyrosine residues in the kinase 
A-loop, which comprises subdomains VII and VIII of the 



protein kinase catalytic core (Hanks et al, 1991). In 
addition to the insulin receptor, other receptor tyrosine 
kinases whose biological function has been shown to 
depend on tyrosine autophospborylation in the A-loop 
include insulin-like growth factor 1 (IGF-1) receptor (Kato 
et al, 1994), fibroblast growth factor (FGF) receptor 
(Mohammadi et al, 1996a), hepatocyte growth factor 
receptor (MET) (Longati et al, 1994), nerve growth 
factor receptor (TRKA) (Mitra, 1991), and brain-derived 
neurotrophic factor receptor (TRKB) (Middlemas et ai, 
1 994). 

The crystal structure of the unphosphorylated, low 
activity form of the insulin receptor kinase domain (IRK) 
has been reported previously (Hubbard et al, 1994). This 
structure suggested an autoinhibitory mechanism whereby 
Tyrll62 in the A-loop competes with protein substrates 
for binding in the active site, while residues in the 
beginning of the A-loop restrict access to ATP such that 
m-autophosphorylation of Tyrn62 is prevented. Here, 
the crystal structure at 1.9 A resolution of the tris- 
phosphorylated, activated form of IRK (IRK3P) in com- 
plex with a peptide substrate and an ATP analog is 
presented. The conformational changes that occur upon 
autophospborylation provide an understanding at the 
molecular level of the role of autophospborylation in the 
activation of the insulin receptor and presumably other 
receptor tyrosine kinases. Moreover, the structure reveals 
the mode of peptide substrate binding to the insulin 
receptor kinase, and affords insights into the phospho- 
transfer mechanism. 

Results and discussion 

Structure determination 

A 306-residue fi-agment of the p-chain of the human 
insulin receptor which possesses tyrosine kinase activity 
was produced in a baculovirus/insect cell expression 
system (Wei et al., 1995). Cytoplasmic residues that are 
not included in the expressed protein are the first 25 
(juxtamembrane region) and the last 72 (C-terminal tail). 
Addition of MgATP to the purified kinase results in 
autophospborylation of Tyrl 158, Tyrll62 and Tyrl 163 as 
confirmed by mass spectrometry and microsequencing of 
tryptic peptides (Wei et al, 1995). Crystals belonging to 
trigonal space group P3221 were obtained fi'om a ternary 
complex consisting of IRK3P, an 18-residue peptide sub- 
strate and a non-hydrolyzable ATP analog, adenylyl imido- 
diphosphate (AMP-PNP). 

The structure of the ternary complex was solved by 
molecular replacement using the structure of IRK as a 
search model, and has been refined at 1.9 A resolution 
with a crystallographic 7?-value of 19.6% (6.0-1.9 A, 
F>2o). The atomic model includes all but the three 
N-terminal residues of IRK3P, one AMP-PNP molecule, 
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Fig. 1. Electron density map of the active site of IRX3P. Stereo view of a 2Ff^-F^ map computed at 1.9 A resolution and contoured at 1.2a. 
Superimposed is the refined atomic model. Carbon atoms are yellow, oxygen atoms red, nitrogen atoms blue, phosphate atoms purple, and sulfur 
atoms green. The red spheres represent water molecules and the white spheres represent Mg^"^ ions. Figure prepared with SETOR (Evans, 1993). 



Table 1. Data collection and refinement summary 


Data collection 
Resolution (A) 


Observations (N) 


Completeness (%) 


Redundancy (%) 


^sym 


Signal <//o(/)> 


20.0-1.9 


93535 


98.6 (90.6)^ 


4.1 


6.5 (15.3)^ 


12.9 


Refinement'^ 
Resolution (A) 


Reflections (N) 


/f-value {Vof 


Root- mean-square 


deviations 










Bonds (A) 


Angles O 


B-factors {k^f 


6.0-1.9 


25082 


19.4 (22.6)*" 


0.008 


1.5 


1.4 



^R,^ = lOOxIbkiL, I /.(hklH/Oikl)) /,(hkl). 
H'alue in parentheses is for the highest resolution shell. 

'^Atomic model includes 303 1RK3P residues, six peptide residues, one AMP-PNP molecule, two Mg'^"*" ions, 202 water molecules (2653 atoms), 
'^/i- value = lOOxZhul |Fo(h*<J)|- Fc(hkl)| l/Zhi,i|FoCh*«^0|. where and F^ are the observ-ed and calculated structure factors, rc^)ectively (Fq >2a>. 
T^or bonded protein atoms. 

Value in parentheses is the free /?- value determined from 5% of the data. 



two Mg^^ ions, six residues of the peptide substrate and 
202 water molecules. An electron density map in the 
vicinity of the active site is shown in Figure 1, data 
collection and refinement statistics are given in Table I, 
and a ribbon diagram of ternary IRK3P is shown in 
Figure 2A. 

Activation loop conformation 

In the unliganded IRK structure determined previously 
(Hubbard et al, 1994), the A-Ioop (residues 1149-1170) 
traverses the cleft between the N- and C-terminal lobes 
such that both protein substrate and ATP binding sites are 
occupied. Tyrll62 in the A-loop is bound in the active 
site, hydrogen -bonded to the putative catalytic base 
(Aspll32), and residues of the protein kinase-conserved 
1I50QPQ sequence in the beginning of the A-loop occupy 
the ATP binding site. Autophosphorylation of Tyrll58, 
Tyrll62 and Tyrll63 results in a dramatic change in the 
conformation of the A-loop (Figures 2B and 4A). Tyrl 158, 
for example, is displaced "-30 A from its position in the 
unphosphorylated A-loop. The conformation of the tris- 
phosphorylated A-loop permits unrestricted access to the 
binding sites for ATP and protein substrates. 

The tris-phosphor>lated A-loop is stabilized to various 



extents by the phosphotyrosine (pTyr) residues 
(Figure 4B). The phosphate group of pTyrll63 bridges 
the A-loop via hydrogen bonds to the side chain of 
Argil 55 on one side of the loop and to the backbone 
amide nitrogen of Glyll66 on the other side. In addition 
to the electrostatic interaction with Argil 55, the phenolic 
ring of pTyrll63 is packed against the aliphatic portion 
of the Argll55 side chain. Argll55 is also hydrogen- 
bonded to the carbonyl oxygen of the preceding residue. 
Arginine or lysine is highly conserved in the tyrosine 
kinase family at this position. The phosphate group of 
pTyrl 162 makes two hydrogen bonds to the side chain of 
Argn64. For receptor tyrosine kinases with tandem tyro- 
sine residues corresponding to Tyrl 162 and Tyrl 163 (e.g. 
TRK, FGF receptor), an arginine or lysine is often found 
at the Argl 164 position. The phosphate group of pTyrl 158 
makes no protein contacts and consequently is the least 
ordered of the three phosphate groups. 

Two short p-strand interactions also serve to stabilize 
the tris-phosphorylated A-loop. p9 in the A-loop is paired 
with p6, which precedes the catalytic loop (residues 1 1 30- 
1 137), and plO in the A-loop is paired with pi 2 between 
aEF and aF (Figures 2A and 3). This pairing scheme 
leaves pll, which in IRK was paired with plO, available 
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Fig. 2. enteral 1 view of IRJGP and comparison with IRK. (A) Ribbon diagram of the IRK3P structure. The a-helices are shown in red, the P-strands 
in blue, the nucleotide-binding loop in yellow, the catalytic loop in orange, the activation loop in green, AMP-PNP in black and the peptide substrate 
in pink. The termini are denoted by N and C (B) Superposition of the C-terminal lobes of IRK and IRK3P. The backbone representation of IRK/ 
IRK3P is colored orang& green, with the activation loop colored red/blue. The axis (black) and arrow (blue) specify the rotation reqmred to align the 
N -terminal P-sheet of IRK with that of IRK3P (C) Superposition of the p-sheets in the N-terminaJ lobes of IRK and IRK3P The backbone 
representation and carbon atoms of IRK 1RK3P are colored orange/green, oxygen atoms are red and nitrogen atoms arc blue. (A) and (C) prepared 
with RIBBONS (Carson, 1991), (B) with GRASP (Nicholls et al, 1991). 



for interaction with a protein substrate. Compared with 
the unphosphorylated A-loop, which was poorly ordered 
for residues 1152-1 157, the tris-phosphorylated A-loop is 
relatively well ordered. 

The conformation of the tris-phosphorylated A-loop is 
in general similar to that of the mono-phosphorylated 
A-loop of cyclic AMP-dependent protein kinase (cAPK) 
(Knighton etai. 1991 ), cyclin-dependent kinase 2 (CDK2) 
(Russo et al, 1996) and the Src-family tyrosine kinase 



LCK (Yamaguchi and Hendrickson, 1996). A comparison 
with these kinase structures reveals that pTyrn63 in the 
IRK A-loop is structurally related to phosphothreonine 
(pThr)197 in cAPK, pThrl60 in CDK2 and pTyr394 in 
LCK (Figure 4B-D). 

Although pTyrll63 is in a spatial position similar 
to pThrl97/pThrl60 in cAPK/CDK2, the number of 
hydrogen-bonding interactions in which pTyrll63 parti- 
cipates is far fewer than for the pThr residues (Figure 4B 
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Fig. 3. Secondary structure comparison of IRK and IRK3P and residue solvent accessibility. The secondary structure assignments for IRK and 
1RK3P were obtained using PROCHECK (Laskowski et al., 1993). Striped bars delineate a- helices and black bars delineate ^strands. The 
assignments for IRK appear on top and those for IRK3P on bottom. The three autophosphorylation sites in the activation loop are marked with an 
asterisk. The numbers under the amino acid sequeiKe represent the fractional solvent accessibility (FS A) of the residue in the IRK3P structure. The 
FSA is the ratio of the solvent-accessible surface area of a residue in a GXG tri-peptide to that in the IRK3P structure. A value of 0 represents an 
FSA between 0.00 and 0.09, 1 represents an FSA between 0.10 and 0.19, etc. The higher the FSA, the more solvent-exposed the residue. The FSAs 
were computed in the abseiKC of MgAMP-PNP, peptide substrate and water molecules. A # in the FSA line indicates that the side chain for that 
residue is not included in the atomic model due to disorder. 



and C). Argl 131, the residue which immediately precedes 
the putative catalytic base, Aspll32, is highly conserved 
in both the tyrosine and serine/threonine kinase families. 
In the cAPK and CDK2 structures (Knighton et al., 1991; 
Russo et al., 1996), the corresponding arginine is directly 
hydrogen-bonded to the pThr. Argl 131, however, is not 
observed to interact directly with pTyrll63. In the LCK 
structure, pTyr394 is also engaged in relatively few 
interactions, differing somewhat from those seen for 
pTyrll63 (Figure 4D), which suggests a susceptibility of 
these pTyr residues to the action of tyrosine phosphatases 
(Yamaguchi and Hendrickson, 1996). 

A comparison of the phosphorylated A-loops of IRK3P 
and LCK (one residue shorter) shows that the conformation 
is similar at the beginning of the loop, which includes the 
conserved ^*^*t)FG residues (^^^DFG), but diverges after 
pTyrll63 (pTyT394) (Figure 4B and D). Residues at 
the end of the LCK A-loop, including kinase-conserved 
Pro403 (Pro 11 72), are displaced further from the active 
site than the corresponding residues in IRK3R Interest- 
ingly, based on the mode of peptide substrate binding 
observed in the present structure (discussed below), the 
C-terminal end of the A-loop in the unliganded LCK 
structiu"e is not properly positioned for peptide binding. 

Lobe closure 

The rearrangement of the insulin receptor A-loop upon 
autophosphorylation facilitates a reorientation of the N- 
and C-terminal lobes of the kinase, which is necessary 
for productive ATP binding. In the structure of the 
unphosphorylated kinase domain of the FGF receptor, the 
ATP binding site is accessible, yet full engagement of 
ATP by the kinase does not occur due to interactions that 
impede lobe rotation (Mohammadi et al., 1996b). In the 
IRK structure, the lobes are held apart by stenc interactions 
between residues of the nucleotide-binding loop (1003- 
1008) and the "^^FG residues. When the C-terminal 
lobes of IRK and ternary IRK3P are superimposed, a 
nearly pure rotation of the N-terminal lobe of IRK by 



^21° aligns the five-stranded p-sheets (Figure 2B). The 
direction of the rotation is both towards the C-terminal 
lobe and parallel to the long axis of the molecule. Three 
pivot points in the rotation can be identified: two in the 
loop between aC and [J4 (near Phel054 and Argl 061) 
and one near the last residue of p5 (Glu 1077), just before 
the segment connecting the two lobes (between p5 and 
aD). The protein kinase-conserved glycine in the con- 
necting segment, Glyl082, does not appear to play a role 
in facilitating lobe movement. 

With respect to cAPK, the only other protein kinase for 
which structures of a ternary complex (with nucleotide 
and peptide) have been reported (Bossemeyer et al., 1 993; 
Zheng et ai, 1993), the lobes of IRK3P are rotated by 
~17°. Despite this difference, the actual lobe separation 
in the IRK3P and cAPK structures is similar as measured 
by the Ca-Ca distance between Vail 010 (Val57) and 
Met 1139 (Leu 173), which flank the ATP adenine. This 
distance is 12.8 A in ternary 1RK3P (15.0 A in IRK) and 
12.9 A in ternary cAPK. The lobe configuration in 
unliganded LCK (Yamaguchi and Hendrickson, 1996) 
is --^8° more open than that in ternary IRK3P, with a 
corresponding lobe-lobe separation of 13.9 A. 

Upon lobe closure, the p-sheet in the N-terminal lobe 
of IRK3P moves approximately as a rigid body. However, 
aC rotates an additional ^35° towards the C-terminal lobe 
(Figure 2C). This movement of aC places protein kinase- 
conserved Glu 1047 in proximity (4.4 A) to conserved 
Lysl030 in the ATP binding site. Three hydrophobic 
residues which lie along the same face of aC, Leu 103 8, 
He 1042 and Leul045, are solvent-exposed in the IRK3P 
structure, whereas the latter two residues are partially 
buried in the IRK structure (Figure 2C). These hydrophobic 
residues, which are not conserved in the tyrosine kinase 
family, are perhaps involved in protein-protein interactions 
specific to insulin receptor subfamily members. 

MgAMP PNP binding 

The entire AMP-PNP molecule is well ordered in the 
ternary IRK3P structure (Figure 1), indicative of a pro- 
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Fig. 4. Confonnation of the IRK3P A-lcx)p and comparison with other kinase A-loops. (A) Comparison of the A-loc^ conformations in IRK and 
IRK3P. The activation loop is colored green, the cata]>tic loop orange and the peptide substrate pink. The rest of the protein in each case is 
represented by a semi transparent molecular surface. Also shown is ANtP-PNR which is partially masked by the N-terminaJ lobe of IRK3P. Cartwfi 
atoms are colored white, nitrogen atoms blue, oxygen atoms red and phosphorus atoms yellow. View is -90° from that in Figure 2 A, from the right 
side (B) Selected hydrogen -bonding mteractions for activation loop residues of IRKJP arc shown as black lines. Atom coloring is the same as in 
(A) except that carbon atoms are gray. Coloring of the backbone representation is the same as in (A), Only a portion of the peptide substrate 
backbone is shown (P-1 to P+ 1). (C) Selected hydrogen-borxiing interactions for activation loop residues of c.\PK (Zheng et al, 1993) arc dwwn 
as black lines. Coloring same as in (B). with other backbone scgirients colored blue Only a portion of the p)eptide inhibitor backbone is shown (P-1 
to P + 1). (D) Selected hydrogen-bonding interactions for activation loop residues of LCK (Yamaguchi and Hendrickson. 1996) arc shown as black 
lines. Coloring same as in (B). Figure prepared with GRASP (Nicholls et ai, 1991) 
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Fig. 5. Interactions of MgAMP-PNP in the ternary IRKJP structure. 
The coloring of the backbone representation is the same as in Figure 
4B, with the nucleoti de-binding loop colored yellow. Atom coloring is 
the same as in Figure 4B, with sulfur atoms colored green and Mg 
ions colored purple. Selected hydrogen bonds are shown as thin black 
lines and bonds to Mg^'*' ions are shown as thick black lines. Figure 
prepared with GRASP (Nicholls et al., 1991). 



ductive binding mode. The interactions of AMP-PNP with 
IRK3P are illustrated in Figure 5. The adenine is situated 
between Vail 010 and Met 1139, and hydrogen-bonded to 
backbone groups of Glul077 and Met 1079. One of the 
nbose hydroxy 1 groups (02') is hydrogen-bonded to the 
side chain of Asp 1083, and the other (03') is hydrogen- 
bonded via a water molecule to the carbonyl oxygen of 
Argil 36 in the catalytic loop. The analogous interactions 
for the adenine and ribose are observed in the temary 
cAPK structure (Bossemeyer et al., 1993; Zheng et al, 
1993), except that 03' is hydrogen-bonded directly to the 
carbonyl oxygen of Glul70 (Argll36). 

The interactions of the nucleotide phosphate groups 
with IRK3P residues differ from those observed in the 
ternary cAPK structure. In cAPK, the nucleotide-binding 
loop is hydrogen-bonded via three backbone groups to 
the and y-phosphates of ATP (Bossemeyer et al., 1993; 
Zheng et al, 1993). In IRK3P, only one hydrogen bond 
is observed, between the backbone amide nitrogen of 
Serl006 and the p-phosphate (Figure 5). Moreover, in 
cAPK, protein kinase-conserved Lys72 is hydrogen- 
bonded to the a- and p-phosphates, whereas in IRK3P, 
Lysl030 is hydrogen-bonded to just the a-phosphate. 

The greater extension of the nucleotide-binding loop 
over the phosphates in temary cAPK is probably due 
to hydrogen-bonding between the loop and the peptide 
inhibitor PKI (Bossemeyer et al, 1993). In the temary 
IRK3P structure, no interactions are observed between the 
nucleotide-binding loop and the peptide substrate. Owing 
to the longer side chain of tyrosine versus serine/threonine, 
the peptide substrate is positioned significantly further 
away from the N-terminal lobe in IRK3P than in cAPK 
(Figure 4B and C). It is likely, therefore, that the lack of 
interaction with the nucleotide-binding loop is a general 
feature of protein substrate-tyros ine kinase interactions. 

Two well-ordered Mg^"*^ ions have been included in the 



atomic model based on the heights of the electron density 
peaks and the number and distances of coordinating atoms 
(Figures 1 and 5). Mgl shows nearly ideal octahedral 
coordination by six oxygen atoms (Mg O = 2.0-2.2 A): 
one from each of the P- and y-phosphates, one from each 
of the side chains of protein kinase-conserved Asnll37 
(catalytic loop) and Asp 1 1 50 (A-loop), and two from well- 
ordered water molecules (Figure 5). Mg2 has seven oxygen 
ligands (Mg O — 2.4—2.6 A): the same p-phosphate 
oxygen that coordinates Mgl, both carboxylate oxygens 
of Aspl 1 50, and four water molecules. Mgl and Mg2 are 
in positions similar to those of the inhibitory and activating 
metal ions, respectively, in temary cAPK (Bossemeyer 
et al, 1993; Zheng et al, 1993), but the coordination by 
the phosphate groups differs. Mgl is coordinated by the 
p- and y-phosphates, whereas the inhibitory metal ion in 
temary cAPK is coordinated by the a- and y-phosphates. 
Mg2 is coordinated by just the p-phosphate, whereas the 
activating metal ion in temary cAPK is coordinated by 
the p- and y-phosphates. The coordination environment 
of Mgl is strikingly similar to that of the Mg^^ ion in 
the structures of GTPases such as Ras (with MgGMP- 
PNP; Pai et ai, 1990) and transducin-a (with MgGTPyS; 
Noel et al, 1993), wherein the Mg^^ ion is octahedrally 
coordinated by the p- and y-phosphates, two side-chain 
oxygens, and two water molecules, in the same spatial 
configuration as found in the temary IRK3P structure. 

Peptide substrate binding 

The 18-residue peptide substrate that was co-crystallized 
with IRK3P and MgAMP-PNP was derived from a putative 
insulin receptor phosphorylation site on rat IRS-1, Tyr727 
(Sun et al, 1991). This peptide, ICKKLPATGDYMN- 
MSPVGD, has a reported of 24 ^iM (Shoelson et ai, 
1992). Of the 18 residues, supporting electron density is 
seen for only six, from the P-2 to the P+3 residue 
[GDYMNM; P-residue is the acceptor tyrosine, Tyr(P)]. 
The residues C -terminal to Tyr(P) form a short p-strand 
which pairs in an anti -parallel manner with residues of 
pll at the end of the A-loop (Figures 2A and 6B). 
Backbone hydrogen-bonding is observed between 
Mct(P4-]) and Leull71, and between Met(P+3) and 
Glyn69. 

Tyrosine residues in YMXM motifs are efficient sub- 
strates of the insulin receptor in vitro and in vivo (Shoelson 
et al, 1992; Sun et al. 1993; Songyang et al., 1995). In 
the temary 1RK3P structure, the methionine side chains 
of the peptide fit into two adjacent hydrophobic pockets 
on the surface of the C-terminal lobe, composed of residues 
from the P+1 loop (1171-1176) and from aEF and aG 
(Figure 6). The pocket for Met(P+l) comprises Vail 173, 
Leul219 and the aliphatic portions of the Asnl215 and 
Glul216 side chains. The pocket for Met(P+3) comprises 
Leull71,Vani73,Metll76, LeullSl andLeul219. These 
pockets appear to be optimized for accommodating long 
hydrophobic side chains. 

The temary IRK3P structure provides a basis for umfer- 
standmg tyrosine kinase substrate specificity. In a study 
of substrate specificity using a degenerate peptide library, 
the cytoplasmic tyrosine kinase Fps was shown to have a 
strong preference for glutamic acid at the P+ 1 position 
of a peptide substrate (Songyang et al, 1995). Of the four 
insulin receptor residues that form the binding pocket for 
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Fig. 6. Mode of peptide substrate binding. (A) Overall view of peptide substrate binding to IR1C3P. The molecular surface of IRK3P is colored 
according to electrostatic potential (calculated in the absence of MgAMP-PNP and peptide). Positive potential is shown in blue, negative potential in 
red and neutral jwtential in white. Carbon atoms are green, oxygen atoms are red, nitrcigen atoms are blue, sulfur atoms are black and phosphorus 
atoms are yellow. The doned line encompasses Lysl085, ArglOS^ and Argl092 of oD View is approximately the same as in Figure 4 A. (B) Stereo 
view of the interactions between the peptide substrate and 1RK3P. Oxygen atoms are red, nitrogen atoms are blue, carbon atoms of IRK3P/peptide 
are orange/green and sulfur atoms of IRK3P/peptide are greea'black. The molecular surface is semi-transparent, and selected hydrogen bonds arc 
shown as white lines. The red sphere represents a water mc»lecule. Figure prepared with GRASP (NichoUs et al , 19^1). 



Met(P+l), three are identical or similar in Fps, while the 
fourth, Leul219, is an arginine in Fps (Arg770). Modeling 
suggests that an arginine at the Leul219 position could 
make a hydrogen bond with a glutamic acid at the P+ 1 
position of a peptide substrate. In the receptor tyrosine 
kinase RET, the substitution Met918— >Thr leads to 
multiple endocrine neoplasia type 2B (Hofstra et al., 
1994), and is thought to alter the substrate specificity of 
RET (Songyang et al, 1995; Pandit et al , 1996). The 
corresponding methionme in the insulin receptor, Metl 1 76, 
is a constituent of the Met(P+3) bmding pocket. 

Substrates of the insulin receptor and other tyrosine 
kinases often contain one or more acidic residues 
N-terminal to the acceptor tyrosine (Hunter, 1982; 
Stadtmauer and Rosen, 1983; Songyang et al., 1995). The 
co-crystallized peptide contains an aspartic acid at the P- 
1 position. In the crystal structure, Asp(P-l ) is not directly 
hydrogen-bonded to residues of the kinase, but instead 
makes a water-mediated hydrogen bond to Lysl085 
(Figure 6B). Two other positively charged residues, 
Argl089 and Argl092, extend from the same face of aD 
as Lysl085 towards the substrate binding region (Figure 
6 A). At the position corresponding to Argl089, a large 
majority of tyrosine kinases contain a hydrogen-bond 
donor which is often positively charged. This residue may 
play a role in the recognition of protein substrates with 
acidic residues at the P-3 or P— 4 position. 



Active site 

The hydro xyl group of the substrate t>Tosine, Tyr(P), is 
hydrogen-bonded to the carboxylate group of Asp 11 32 
(Or| 052 = 2.7 A), and thus appears to be in position 
for proton abstraction by Aspll32 (Figures 4B and 5). 
The hydroxy 1 group is also hydrogen -bonded to tyrosine 
kinase-conser\'ed Argil 36 in the catalytic loop 
(Or| Ne = 3.1 A). The phenolic ring of Tyr(P) is oriented 
in part by van der Waals interactions with tyrosine kinase- 
conserved Proll72 (P+1 loop), positioned 3.6 A from 
the ring (Figures 4B and 6B). 

Based on the autoinhibited IRK structure, a difference 
in the conformation of the P+1 loop near Proll72 
appeared to be a major determinant in conferring specificity 
for tyrosine versus serine/threonine (Hubbard et al, 1994; 
Taylor et al, 1995). This is confirmed by the ternary 
IRK3P structure, in which backbone hydrogen -bonding 
between the peptide substrate and the P+ 1 loop of the 
kinase governs the distance between the peptide backbone 
and the kinase active site (Figure 6B); the side cham of 
tyrosine but not of serine/threonine is long enough to 
reach the active site. 

In the ternary IRK3P structure, the y-phosphorus atom 
of AMP-PNP is 5 0 A from the hydroxy 1 oxygen of Tyr(P) 
(Figure 5). and is therefore not in position for direct in- 
line (associative) phosphotransfer, the catalytic mechanism 
proposed for cAPK (Madhusudan et al , 1994). The use 
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of AMP-PNP in co-crystallizations (to preclude substrate 
phosphorylation) could introduce a deviation in y-phos- 
phate positioning, although no significant differences in 
nucleotide binding are seen in the structures of cAPK 
with ATP (Zheng et al, 1993) or with AMP-PNP 
(Bossemeyer et ai, 1993). The present structure is the 
first to be reported of a complex between a protein 
kinase, a peptide substrate, and a y-phosphate-conlaining 
nucleotide. Trapping the enzyme with both the non- 
transferable y-phosphate and acceptor hydroxyl in position 
for associative phosphotransfer may be inherently difficult. 
Alternatively, in view of the difference in the p/Ca values 
of tyrosine (~10) and serine/threonine (^13), and the 
presence in the catalytic loop of an arginine (tyrosine 
kinases) rather than a lysine (serine/threonine kinases), the 
structure may suggest a dissociative or mixed associative- 
dissociative phosphotransfer mechanism (Bramson et al, 
1984) for tyrosine kinases. 

Functional implications 

The crystal structures of IRX and IRK3P afford an 
understanding at the molecular level of insulin receptor 
activation via autophosphorylation. The IRK structure, 
together with mutagenesis data showing that substitution 
of Tyrl 162 with phenylalanine increases basal level kinase 
activity (Ellis et al, 1986), suggests that residues of the 
unphosphorylated A-loop compete with ATP and protein 
substrates for binding at the kinase active site. Upon 
insulin-triggered rra/i^-autophosphorylation of the tyrosine 
residues within the A-loop, this peptide segment adopts a 
markedly different conformation which is stabilized by 
both pTyr and non-pTyr interactions. For the insulin 
receptor and presumably other receptor tyrosine kinases, 
stimulation of kinase activity by autophosphorylation is 
evidently a consequence of stabilization of an A-loop 
conformation that (i) allows unhindered access to ATP 
and protein substrates, and (ii) facilitates the correct 
positioning of the residues critical for MgATP binding 
and catalysis. 

The bridging interactions of pTyrl 1 63 with other A-loop 
residues and its similar spatial position to pThrl97 in 
cAPK imply that pTyrl 1 63 is the key pTyr in stabilizing the 
conformation of the tris-phosphorylated A-loop. Although 
studies of Tyr— >Phc substitutions in the insulin receptor 
A-loop have not demonstrated a predominant role in 
kinase activation for a particular tyrosine (Tavare and 
Siddle, 1993), the tyrosine corresponding to Tyrl 163 has 
been shown to be the critical autophosphorylation site in 
the FGF receptor (Mohammadi et al, 1996a) and in MET 
(Longati et al, 1994). 

Autophosphorylation mapping experiments have indi- 
cated that Tyrl 163 is the last of the three A-loop sites to 
be phosphorylated (Dickens and Tavare, 1992; Wei et al, 
1995). This result, together with the mutagenesis data and 
the structtiral results presented here, is consistent with a 
graded activation mechanism in which each autophos- 
phorylation event in the A-loop leads to partial stabilization 
of the active configuration of the A-loop, with full activa- 
tion achieved only upon autophosphorylation of pTyrl 163. 
Conceivably, in partially activated (mono- and bis-phos- 
phorylated) states, pTyrll58 and pTyTll62 are engaged 
in a different set of interactions than those observed in the 
IRK3P structure. For example, pTyrll62 may substitute 



structurally for pTyrll63 in the bis-phosphorylated state, 
though presumably less effectively. 

The lack of engagement of pTyrl 158 with other kinase 
residues and the solvent accessibility of the pTyrll62/ 
Argll64 pair suggests that these pTyr residues could 
potentially serve as docking sites for downstream signaling 
proteins. Several recent studies have provided evidence 
that one or more of the A-loop pTyr residues are involved in 
protein-protein interactions. Yeast two-hybrid experiments 
have identified an interaction between a region in IRS-2 
and the cytoplasmic domain of the insulin receptor, which 
is dependent on tyrosine phosphorylation in the A-loop 
but not in the juxtamembrane region or C-terminal tail 
(He et al, 1996; Sawka-Verhelle et al, 1996). Other yeast 
two-hybrid experiments together with in vitro binding 
studies have provided evidence for interactions between 
the cytoplasmic domain and the Src-homology 2 (SH2) 
domains of GRBIO/IR-SVI (O'Neill et al, 1996; Frantz 
et al, 1997) and SHP-2 (Kharitonenkov et al, 1995), 
which are dependent on phosphorylation of one or more 
of the A-loop tyrosine residues. The IRK3P structure 
provides evidence, albeit circumstantial, that one or more 
pTyr residues of the insulin receptor A-loop may serve as 
targets for downstream signaling molecules. 

Materials and methods 

Expression, purification and crystallization of IBK3P 

A recombinant baculovirus was engineered to encode residues 978-1283 
of the human insulin receptor with the inclusion of two amino acid 
substitutions: Cys981-^Ala and Tyr984->Phe (Wei et al, 1995). Expres- 
sion and purification of IRK have been described previously (Hubbard 
et al., 1994). Typically, IRK3P was produced by adding ATP and MgC^ 
(10 mM and 25 mM final, respectively) to IRK (-1 mg/ml final) at 4°C. 
After 20-30 min, the autophosphorylation reaction was terminated by 
the addition of EDTA (50 mM final). The reaction mixture was passed 
over a Superdex 75 (Pharmacia) gel filtration column to remove 
nucleotides, Mg^"^ and EDTA. The mainly tris-phosphorylated kinase 
species was separated from the bis-phosphorylated species by Mono Q 
(Phamiacia) ion exchange chromatography. Purified IRK3P was concen- 
trated to 10-15 mg/ml using a Centricon-10 (Amicon) in 20 mM Tris- 
HCI, pH 7.5, 200 mM NaCl and 200 (iM Na3V04. Solid-phase synthesis 
of the 18-residue peptide substrate was performed on an Applied 
Biosystems model 741 A synthesizer using Fmoc chemistry, and AMP- 
P>4P was purchased fix»m Boehringer-Mannheim. 

Crystals were grown at by vapor diffusion in hanging drops 
containing 2.0 ^1 of protein solution (10 mg,'ml IRK3P, 2 mM AMP- 
PNP, 6 mM MgCl2 and 0.5 mM peptide substrate) and 2 .0 ^il of reservoir 
solution [22% polyethylene glycol (PEG) 8000, 100 mM Tris-HCl, 
pH 7.5, and 2% ethylene glycol]. The crystals belong to trigonal space 
group P3221 and have unit cell dimensions of a = b ~ 66.5 A. c = 
139.1 A when frozen. There is one molecule in the asymmetric unit and 
the solvent content is 50% (assuming a partial specific voimnc of 
0.74 cm^/g). 

Data collection, structure determination arni analysis 

One cryo-cooled crystal was used for data collection. The crystal was 
transferred into a cryo-protectant containing 25% PEG 8000, 1 mM 
AMP-PNP, 3 mM MgC!2, 0.25 mM peptide substrate, 100 mM Tns- 
HCl, pH 7.5 and 15% ethylene glycol. After ~1 min in the cryo- 
protectant, the crystal was flash-cooied in liquid nitrogen and then 
transferred to the goniostat, which was bathed in a dry nitrogen stream 
at -I60°C. Data were collected at beamline X4A at the National 
Synchrotron Light Source. Brookhaven National Laboratory, on Fuji 
image plates and digitized with a Fuji scanner. All data were ^Mxx^cssed 
using DENZO and SCALEPACK (Otwinowski, 1993). 

A molecular replacement solution was found with a search molecule 
consisting of IRK residues 981-1283, exchidmg 1149-1170 (A-loop), 
for which the N-terminal lobe (residues 981-1081) was rotated and 
translated as a rigid body so that the relative orientation of the N- arid 
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C-temrunaJ lobes was similar to that in closed-form cAPK (Zheng et al., 
1993). An initial 3.0 A data set collected on a Rjgaku R-.\XIS IIC 
image plate detector w'as used for molecular replacement (data not 
shown). With AMoRe OJavaza, 1994), using 80*?^© of the structure factor 
amplitudes between 15.0 and 3.5 A, the correlation coefficient for the 
correct solution in space group P3221 was 0.46 versus 0.34 for the 
highest incorrect solution in the enantiomeric space group P3i21. The 
AMoRe solution was refined in X-PLOR (Briinger, 1992), first as one 
rigid-body unit, then as two units each comprising a lobe of the kinase 
(981-1081, 1082-1283), resulting in a relative rotation of the two lobes 
of -13'^ and an increase of the correlation coefficient from 0.33 to 
0.41 (10.0-3.4 A, F>2a). Simulated annealing and conjugate- gradient 
minimization were performed using X-PLOR, and mode! building was 
performed using TOM/FRODO (Jones, 1985). 

The average atomic B- factors are 18.5 A^ for protein, 15.8 A^ for 
MgAMP-PNP, 21.9 A^ for peptide and 25.3 A^ for water molecules. 
Due to poor supporting electron density, residues 978-980 are not 
included in the atomic model, nor arc the side chains indicated in Figure 3. 
Atomic superpositions were performed with TOSS (Hendrickson, 1979). 
Per residue solvent accessible surface calculations were done with 
X-PLOR with a probe radius of 1.4 A As defined in PROCHHCK 
(Laskowski et al., 1993), the backbone torsion angles of the protein 
lie either in most favored regions (93%) or in additional allowed 
regions (7°/o). 
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